NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videos

Turishcheva, Polina; Fahey, Paul G; Vystrčilová, Michaela; Hansel, Laura; Froebe, Rachel E; Ponder, Kayla; Qiu, Yongrong; Willeke, Konstantin Friedrich; Bashiri, Mohammad; Baikulov, Ruslan; et al (November 2024, NeurIPS 2024)

Full Text Available
Bidirectional Language Models Are Also Few-shot Learners

Patel, Ajay; Li, Bryan; Rasooli, Mohammad Sadegh; Constant, Noah; Raffel, Colin; Callison-Burch, Chris (May 2023, The Eleventh International Conference on Learning Representations (ICLR 2023))

Large language models such as GPT-3 (Brown et al., 2020) can perform arbitrary tasks without undergoing fine-tuning after being prompted with only a few labeled examples. An arbitrary task can be reformulated as a natural language prompt, and a language model can be asked to generate the completion, indirectly performing the task in a paradigm known as prompt-based learning. To date, emergent prompt-based learning capabilities have mainly been demonstrated for unidirectional language models. However, bidirectional language models pre-trained on denoising objectives such as masked language modeling produce stronger learned representations for transfer learning. This motivates the possibility of prompting bidirectional models, but their pre-training objectives have made them largely incompatible with the existing prompting paradigm. We present SAP (Sequential Autoregressive Prompting), a technique that enables the prompting of bidirectional models. Utilizing the machine translation task as a case study, we prompt the bidirectional mT5 model (Xue et al., 2021) with SAP and demonstrate its few-shot and zero-shot translations outperform the few-shot translations of unidirectional models like GPT-3 and XGLM (Lin et al., 2021), despite mT5's approximately 50% fewer parameters. We further show SAP is effective on question answering and summarization. For the first time, our results demonstrate prompt-based learning is an emergent property of a broader class of language models, rather than only unidirectional models.
more » « less
Full Text Available
Enhancing Human Summaries for Question-Answer Generation in Education

https://doi.org/10.18653/v1/2023.bea-1.9

Gonzalez, Hannah; Dugan, Liam; Miltsakaki, Eleni; Cui, Zhiqi; Ren, Jiaxuan; Li, Bryan; Upadhyay, Shriyash; Ginsberg, Etan; Callison-Burch, Chris (July 2023, Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023))

We address the problem of generating high-quality question-answer pairs for educational materials. Previous work on this problem showed that using summaries as input improves the quality of question generation (QG) over original textbook text and that human-written summaries result in higher quality QG than automatic summaries. In this paper, a) we show that advances in Large Language Models (LLMs) are not yet sufficient to generate quality summaries for QG and b) we introduce a new methodology for enhancing bullet point student notes into fully fledged summaries and find that our methodology yields higher quality QG. We conducted a large-scale human annotation study of generated question-answer pairs for the evaluation of our methodology. In order to aid in future research, we release a new dataset of 9.2K human annotations of generated questions.
more » « less
Full Text Available
Prosody Prediction from Syntactic, Lexical, and Word Embedding Features

https://doi.org/10.21437/SSW.2019-48

Sloan, Rose; Akhtar, Syed Sarfaraz; Li, Bryan; Shrivastava, Ritvik; Gravano, Agustin; Hirschberg, Julia (September 2019, 10th ISCA Speech Synthesis Workshop)

Accurate prosody prediction from text leads to more natural-sounding TTS. In this work, we employ a new set of fea- tures to predict ToBI pitch accent and phrase boundaries from text. We investigate a wide variety of text-based features, in- cluding many new syntactic features, several types of word em- beddings, co-reference features, LIWC features, and specificity information. We focus our work on the Boston Radio News Corpus, a ToBI-labeled corpus of relatively clean news broad- casts, but also test our classifiers on Audix, a smaller corpus of read news, and on the Columbia Games Corpus, a corpus of conversational speech, in order to test the applicability of our model in cross-corpus settings. Our results show strong per- formance on both tasks, as well as some promising results for cross-corpus applications of our models.
more » « less
Full Text Available

Search for: All records